PyDigger - unearthing stuff about Python


NameVersionSummarydate
swebench 3.0.5 The official SWE-bench package - a benchmark for evaluating LMs on software engineering 2025-02-01 22:33:00
mteb 1.31.8 Massive Text Embedding Benchmark 2025-02-01 16:03:30
pytest-codspeed 3.2.0 Pytest plugin to create CodSpeed benchmarks 2025-01-31 14:28:26
airflow-parse-bench 1.0.1 Easily measure and compare your Airflow DAGs' parse time. 2025-01-26 03:39:23
mrna-bench 1.0.1 Benchmarking suite for mRNA property prediction. 2025-01-23 23:15:44
opencompass 0.4.0 A comprehensive toolkit for large model evaluation 2025-01-22 06:42:16
folktexts 0.0.27 Use LLMs to get classification risk scores on tabular tasks. 2025-01-17 16:27:47
fusion-bench 0.2.9 A Comprehensive Benchmark of Deep Model Fusion 2025-01-17 06:57:43
pydftracer 1.0.8 I/O profiler for deep learning python apps. Specifically for dlio_benchmark. 2024-12-17 03:37:11
rdt 1.13.2 Reversible Data Transforms 2024-12-16 22:46:10
qpbenchmark 2.4.0 Benchmark for quadratic programming solvers available in Python. 2024-12-16 09:24:00
ms-opencompass 0.1.5 A lightweight toolkit for evaluating LLMs based on OpenCompass. 2024-12-16 08:05:22
mlrb-agent-tasks 0.0.23 A task package for ML Research Bench 2024-12-10 16:21:44
EpiLog 1.1.2 Simple No-Frills Logging Manager 2024-12-06 21:32:56
nodespecs 0.1.1 The specs summarize utilities for computer instance 2024-12-06 15:43:45
Younger 0.0.1a2 A Younger Project for Artificial Intelligence: Datasets, Benchmarks, and Applications. 2024-11-25 08:01:45
syntherela 0.0.3 SyntheRela - Synthetic Relational Data Generation Benchmark 2024-11-21 06:20:37
cmdbench 0.1.22 Quick and easy benchmarking for any command's CPU, memory, disk usage and runtime. 2024-11-20 06:53:26
construe 0.2.0 An LLM inferencing benchmark tool focusing on device-specific latency and memory usage 2024-11-13 03:52:24
guardbench 1.0.0 GuardBench: A Large-Scale Benchmark for Guardrail Models 2024-11-12 02:44:56
hourdayweektotal
2712697033284255
Elapsed time: 2.42781s